A hidden challenge of link prediction: which pairs to check?
نویسندگان
چکیده
The traditional setup of link prediction in networks assumes that a test set node pairs, which is usually balanced, available over to predict the presence links. However, practice, there no set: ground truth not known, so number possible pairs quadratic nodes graph. Moreover, because graphs are sparse, most these will be Thus, methods, often rely on proximity-preserving embeddings or heuristic notions similarity, face vast search space, with many close proximity, but should linked. To mitigate this issue, we introduce LinkWaldo, framework for choosing from quadratic, massively skewed space concise candidate that, addition being also structurally resemble observed edges. This allows it ignore some high-proximity low-resemblance and identify high-resemblance, lower-proximity pairs. Our built model theoretically combines stochastic block models (SBMs) proximity models. structure SBM maps out where new links expected fall, identifies plausible within blocks, using locality sensitive hashing avoid expensive exhaustive search. LinkWaldo can use any representation learning definition generate method, allowing power current future methods realized practice. We evaluate 13 across multiple domains show average returns sets containing 7–33% more missing than both embedding-based baselines’ sets. code at https://github.com/GemsLab/LinkWaldo.
منابع مشابه
hidden caries: a new challenge for dentists
purpose: hidden caries is a term used to describe occlusal dentine caries that is missed on a visual examination, but can be detected by bitewing radiographs. the aim of this study is the review of new studies about this lesion and presentation the ways for diagnosis, prevention and treatmen of this lesion. review of literature: investigations believe that the etiology of this lesion is the mor...
متن کاملHidden space reconstruction inspires link prediction in complex networks
Hao Liao, Mingyang Zhou, Zong-wen Wei, Rui Mao,Alexandre VidmerYi-Cheng Zhang 1 Guangdong Province Key Laboratory of Popular High Performance Computers, College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, P.R. China 2 Department of Physics, University of Fribourg, Chemin du Musée 3, CH-1700 Fribourg, Switzerland 3 Department of Modern Physics, University ...
متن کاملHidden link prediction based on node centrality and weak ties
Link prediction has been widely used to extract missing information, identify spurious interactions, evaluate network evolving mechanisms, and so on. In this context, similaritybased algorithms have become the mainstream. However, most of them take into account the contributions of each common neighbor equally to the connection likelihood of two nodes. This paper proposes a model for link predi...
متن کاملGrand Challenge: Model Check Software
Model checking has been successfully employed for verification of industrial hardware systems. Recently, model checking techniques have also enjoyed limited success in verifying software systems, viz., device drivers. However, there are several hurdles which must be overcome before model checking can be used to handle industrial-scale software systems. This article reviews some of the prominent...
متن کاملA Hidden Challenge to the Regulation of Data Surveillance
Profiling is a data surveillance technique which is little-understood and ill-documented, but increasingly used. It is a means of generating suspects or prospects from within a large population, and involves inferring a set of characteristics of a particular class of person from past experience, then searching data-holdings for individuals with a close fit to that set of characteristics. It is ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Knowledge and Information Systems
سال: 2022
ISSN: ['0219-3116', '0219-1377']
DOI: https://doi.org/10.1007/s10115-021-01632-x